Hybrid MPI/OpenMP parallelization in FETI-DP methods
نویسندگان
چکیده
We present an approach to hybrid MPI/OpenMP parallelization in FETIDP methods using OpenMP with PETSc in the finite element assembly and using the shared memory parallel direct solver Pardiso in the FETI-DP solution phase. Our approach thus uses OpenMP parallelization on subdomains and MPI in between subdomains. We investigate the efficiency of this approach for a benchmark problem from two dimensional nonlinear hyperelasticity. We observe good scalability for up to 4 threads for each MPI rank on a state-of-the-art Ivy Bridge architecture and incremental improvements for up to 10 OpenMP threads for each MPI rank.
منابع مشابه
Advanced Hybrid MPI/OpenMP Parallelization Paradigms for Nested Loop Algorithms onto Clusters of SMPs
The parallelization process of nested-loop algorithms onto popular multi-level parallel architectures, such as clusters of SMPs, is not a trivial issue, since the existence of data dependencies in the algorithm impose severe restrictions on the task decomposition to be applied. In this paper we propose three techniques for the parallelization of such algorithms, namely pure MPI parallelization,...
متن کاملParallelization of the Treecode Algorithm for N-Body Simulation Using MPI, Hybrid, and GridRPC Programming Paradigms
This dissertation describes the parallelization of the treecode algorithm for N-Body problem and performance comparison among three different parallel programming paradigms, MPI, hybrid MPI-OpenMP, and GridRPC. In N-Body simulation, the specific routine for calculating the forces on the bodies which accounts for upwards of 90% of the cycles in typical computations is eminently suitable for obta...
متن کاملParallelization of a finite difference scheme for solving systems of 2D Sine-Gordon equations
Systems of perturbed 2D Sine-Gordon equations coupled via a cyclic tridiagonal matrix are solved numerically by a second-order centered finite difference scheme. The systems are considered on rectangular domains. First an OpenMP parallel program is realized and very good performance scalability inside one computational node is achieved. The tests on one computational node of the CPU platform in...
متن کاملEfficiency of Large-scale Cfd Simulations on Modern Supercomputers Using Thousands of Cpus and Hybrid Mpi+openmp Parallelization
This work represents an experience in using the hybrid parallel model to perform large-scale DNS. Advantages of the hybrid approach compared to the MPI-only approach are presented and discussed. The use of OpenMP in addition to MPI is demonstrated for modelling of compressible and incompressible flows using both structured and unstructured meshes. A parallel Poisson solver for incompressible fl...
متن کاملA Hybrid Parallelization of AIM for Multi-Core Clusters: Implementation Details and Benchmark Results on Ranger
This paper presents implementation details and empirical results for a hybrid message passing and shared memory paralleliziation of the adaptive integral method (AIM). AIM is implemented on a (near) petaflop supercomputing cluster of quad-core processors and its accuracy, complexity, and scalability are investigated by solving benchmark scattering problems. The timing and speedup results on up ...
متن کامل